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The spike (S) glycoprotein of coronaviruses mediates 
viral entry into host cells. It is a type 1 viral fusion 
protein that characteristically contains two heptad re- 
peat regions, denoted HR-N and HR-C, that form coiled- 
coil structures within the ectodomain of the protein. 
Previous studies have shown that the two heptad repeat 
regions can undergo a conformational change from 
their native state to a 6-helix bundle (trimer of dimers), 
which mediates fusion of viral and host cell membranes. 
Here we describe the biophysical analysis of the two 
predicted heptad repeat regions within the severe acute 
respiratory syndrome coronavirus S protein. Our re- 
sults show that in isolation the HR-N region forms a 
stable a-helical coiled coil that associates in a tet- 
rameric state. The HR-C region in isolation formed a 
weakly stable trimeric coiled coil. When mixed together, 
the two peptide regions (HR-N and HR-C) associated to 
form a very stable a-helical 6-stranded structure (trimer 
of heterodimers). Systematic peptide mapping showed 
that the site of interaction between the HR-N and HR-C 
regions is between residues 916-950 of HR-N and resi- 
dues 1151-1185 of HR-C. Additionally, interchain disul- 
fide bridge experiments showed that the relative orien- 
tation of the HR-N and HR-C helices in the complex was 
antiparallel. Overall, the structure of the hetero- 
stranded complex is consistent with the structures ob- 
served for other type 1 viral fusion proteins in their 
fusion-competent state. 


In the Fall of 2002, the World Health Organization witnessed 
an outbreak of atypical pneumonia, termed severe acute respi- 
ratory syndrome (SARS),! that quickly spread to >25 countries 
world wide. The causative agent of SARS was quickly identified 
to be a previously unknown member of the family of coronavi- 
ruses (1—4). The coronaviruses are a diverse group of large 
enveloped, positive-stranded RNA viruses that cause respira- 
tory and enteric diseases in humans and other animals. For 
example, previously known human coronaviruses, HCoV-229E 
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and HCoV-OC43, cause up to 30% of common colds and rarely 
cause pneumonia in older adults or immunocompromised pa- 
tients (5-7). Coronaviruses of animals (e.g. porcine transmissi- 
ble gastroenteritis virus, murine hepatitis virus (MHV), and 
avian infectious bronchitis virus) cause respiratory, gastroin- 
testinal, neurological, or hepatic disease in their respective 
hosts (8). In the case of the SARS-associated coronavirus 
(SARS-CoV), the nucleotide sequence shows closest identity to 
novel coronavirus strains isolated from Himalayan palm civets 
and a raccoon dog in the Guandong province of China (9, 10). 
The human isolate of SARS-CoV can also grow in monkeys, 
mice, cats, and ferrets. 

Coronavirus infection is achieved through fusion of the lipid 
bilayer of the viral envelope with host cell membranes. Mem- 
brane fusion is mediated by the viral spike (S) glycoprotein on 
the viral envelope (11-14). The S glycoprotein is synthesized as 
an ~180-kDa precursor that oligomerizes in the endoplasmic 
reticulum and is incorporated into budding virions in a pre- 
Golgi compartment. In some strains, S is cleaved by trypsin or 
related enzymes to yield two non-covalently associated sub- 
units, S1 and $2 (15, 16). S1 contains the receptor-binding site 
and thus defines the host range of the virus (17, 18). S2 is the 
transmembrane subunit that mediates fusion between viral 
and cellular membranes. S2 contains two 4,3-hydrophobic re- 
peat domains (HR) that are predicted to form coiled-coil struc- 
tures (14, 19). These regions are denoted HR-N and HR-C and 
are separated by an intervening stretch of ~140 amino acid 
residues called the interhelical domain. These coiled-coil re- 
gions are thought to play an important role in defining the 
oligomeric structure of the spike protein in its native state and 
its fusogenic ability (20). Furthermore, the presence of the HR 
regions in conjunction with recent studies by Bosch e¢ al. (19) 
indicate that coronavirus spike proteins can be classified as 
type 1 viral fusion proteins. 

Binding of S protein of murine coronavirus MHV to a specific 
soluble or cell surface glycoprotein receptor induces global 
changes in the conformation of S protein that converts S pro- 
tein from a native pre-fusogenic state to a fusion active state 
(Fig. 1). In the case of the MHV-JHM strain, interaction with 
the host cell receptor CEACAM1 causes the S1 domain to 
readily dissociate from the S1/S2 complex, resulting in display 
of a new hydrophobic surface area that allows the virions to 
bind to liposomes and exposes previously hidden trypsin cleav- 
age sites in S2 (21-23). This receptor-induced conformational 
change is believed to facilitate the release of a hydrophobic 
fusion peptide from the interior of S2 and position it to interact 
with the host cell membrane. Next, the fusion active state of 
type 1 viral fusion proteins resolves itself to a fusogenic or 
post-fusion state (24). This latter change is thought to arise 
from an association between the two coiled-coil regions within 
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Fic. 1. Schematic representation of 
the three different states of the coro- 
navirus S fusion protein during viral 
entry. State 1 (native), the S protein is 
denoted S1 and S2 for its N- and C-termi- 
nal domains; state 2 (intermediate state), 
the N-terminal (S1) domain is dissociated SI 
(shedding) to expose the fusion peptide 
(FP) region, and state 3 (collapsed 6-helix 
bundle or fusion active state), the col- 
lapsed S2 domain draws the viral and 82 
cellular membranes together causing fu- 
sion and release of the viral nucleocapsid 
into the host cell. HR-N and HR-C denote 
coiled coils at the N terminus and C ter- 
minus of the S2 domain. X denotes the 
host cell surface receptor. 


the $2 subunit, in which the N-terminal HR region from three 
S proteins forms a parallel triple-stranded coiled coil and the 
interhelical polypeptide chain loops around to reverse the 
polypeptide chain direction, positioning the HR-C region anti- 
parallel to HR-N. The residues at the a and d positions (of the 
heptad repeat positions denoted (abcdefg),,) of HR-C then pack 
into grooves formed by the residues at the a, d, e, and g 
positions of the HR-N core coiled coil to complete the 
6-stranded a-helical bundle structure also termed “trimer of 
hairpins” or “trimer of dimers.” This structure ultimately 
draws the viral and cellular membranes close together, desta- 
bilizing the lipid bilayers that surround the virus and target 
cell, causing fusion and release of the viral nucleocapsid into 
the host cell. The high resolution structures of the ectodomains 
of type 1 viral fusion proteins from many enveloped viruses 
show very similar structures in the fusogenic state, suggesting 
a general membrane fusion mechanism (25-36). 

Because fusion mediated by several type 1 viral fusion pro- 
teins, e.g. human immunodeficiency virus, gp160, can be inhib- 
ited by peptide mimics of the HR-N or HR-C regions (19, 
37—46), and there is a pressing need for effective anti-viral 
therapies to treat SARS, we have carried out a detailed study of 
the biophysical properties of the HR regions within the ectodo- 
main of the SARS-CoV S glycoprotein. We demonstrate that 
the HR-N and HR-C regions of the SARS-CoV S glycoprotein 
can independently form a-helical coiled-coil structures. Fur- 
thermore, we show that a mixture of HR-N and HR-C can form 
a very stable trimer of dimers structure similar to other type 1 
viral fusion proteins with the two helical HR-N and HR-C 
regions running anti-parallel to each other. Systematic peptide 
mapping has localized the site of interaction to residues 916— 
950 within the HR-N domain and 1151-1185 in the HR-C 
domain. 


MATERIALS AND METHODS 


Peptide Synthesis—The HR-N and HR-C peptides of SARS CoV S 
glycoprotein were prepared by solid-phase synthesis methodology using 
4-benzylhydrylamine hydrochloride resin with conventional N-t-bu- 
tyloxycarbonyl chemistry as described by Tripet et al. (47). Peptides 
were N-terminally acetylated, cleaved from the resin, and purified by 
reversed-phase high performance liquid chromatography (HPLC) to 
homogeneity and characterized by amino acid analysis and electrospray 
mass spectrometry. 

Plasmid Construction—Production of the peptide corresponding to 
SARS S protein amino acids 882-973 (HR-N1) was done using direc- 
tional sub-cloning and bacterial expression techniques. PCR fragments 
were prepared from the plasmid SARS S number 18 (containing the 
entire S protein from SARS, Urbani strain (GenBank™ AY278741)). 
Primers were designed to incorporate an Ndel restriction site upstream 
(GTACGTACGCATATGATGCAAATGGCATATAGGTTC) and an 
EcoRI site downstream (GCGAATTCCCTTTGTCGTCGTCGTCGCAG- 
CCGCCTACCTCCGCCTCGACTTTATCAAG) of the amplified frag- 
ment. The NdeI site contains an ATG start codon just upstream of the 
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SARS sequence. The reverse primer was engineered to incorporate the 
amino acid sequence Gly-Gly-Cys-Asp-Asp-Asp-Asp-Lys to insert an 
enterokinase site (DDDDK) just downstream of the synthesized se- 
quence and just upstream of the EcoRI site. The nucleotide fragment 
corresponding to residues 882-973 was amplified by PCR and sub- 
cloned into the EcoRI/Ndel site of the pT7SH6 plasmid, in-frame with 
the His, tag directly downstream of the EcoRI site to create the plasmid 
pT7SH6 SARS-(882-973). pT7SH6 contains an oligonucleotide-encoded 
His, tag followed by two stop codons cloned into the EcoRI-BamHI sites 
of pT7.7 so that cloning into the EcoRI site creates a C-terminal protein 
fusion encoding NSHHHHHHXxX. Plasmid pT7.7 (based on the plas- 
mids of Tabor and Richardson (48)) enables genes to be expressed by T7 
RNA polymerase using the highly efficient translation signals of T7 
gene 10 by cloning into an NdeI (CATATG) site where the ATG is the 
initiation codon. Plasmid SARS S number 18 was created by cloning the 
full-length SARS S protein into pcDNA3.1/V5-His TA cloning vector 
(Invitrogen). The SARS S sequence was amplified using reverse tran- 
scription from SARS genomic RNA (kindly provided by W. Bellini under 
a material transfer agreement with the Centers of Disease Control, 
Atlanta, GA), followed by PCR using primers F21488 (CACCATGTT- 
TATTTTCTTATTATTT) and R25259STOP (TTATGTGTAATGTAATT- 
TGACACC). The reverse primer contains a stop codon to halt transcrip- 
tion before the V5-His, tag in the cloning vector. 

Bacterial Protein Expression and Purification—BL21 AI Escherichia 
coli cells (Invitrogen) were transformed with pT7SH6 SARS-(882-973). 
These cells were grown overnight in Luria Broth (LB) containing am- 
picillin (Sigma). The overnight culture was used to inoculate a flask of 
LB, containing ampicillin, and the cells were grown to an Agog of 0.7. 
The cells were induced by adding 10% L-arabinose (Sigma) to a final 
concentration of 0.2%. Three hours later the cells were pelleted, resus- 
pended in 1/10th volume of 20 mm Tris/HCl, pH 8.0, and protease 
inhibitors (Complete EDTA-free, Roche Applied Science). The sample 
was frozen at —80 °C and thawed at 50 °C three times. 1 mg of DNase 
(Promega, Madison, WI) was added to the sample and incubated at 
room temperature for 20 min. The sample was centrifuged at 14,000 x 
g for 10 min. The soluble supernatant medium was purified with Hi- 
Trap nickel affinity columns (Amersham Biosciences) as described pre- 
viously (49). The column was washed with 20 mm Tris/HCl, pH 8.0, and 
bound proteins were eluted with a 0-500 mm gradient of imidazole 
(Sigma) in 20 mm Tris/HCl, pH 8.0. Fractions containing the SARS- 
(882-973) (HR-N1) peptide were identified using immunoblot analysis 
with an anti-His, polyclonal antibody (AbCam, Cambridge, MA). These 
fractions were pooled and placed over a reversed phase chromatography 
column for purification. 

Circular Dichroism Spectroscopy—Circular dichroism spectra were 
recoded on a Jasco J-810 spectropolarimeter (Jasco Inc., Easton, MD). 
The CD wave scans were measured from 190 to 255 nm in benign buffer 
(0.1 M KCl, 0.05 mM PO,, pH 7). For samples containing trifluoroethanol 
(TFE), the above buffer was diluted 1:1 (v/v) with TFE. Temperature 
denaturation midpoints (T1,) for the peptides were determined by fol- 
lowing the change in molar ellipticity at 222 nm from 4 to 95 °C ina 
1-mm path length cell and a temperature increase rate of 1 °C/min. 
Ellipticity readings were normalized to the fraction of peptide folded (f;) 
or unfolded (f,,) using the standard equations f; = ([6] — [6],,)(L6],, — [6],,) 
and f, = (1 — f,); where [6],, and [6],, represent the ellipticity values for 
the fully folded and fully unfolded species, respectively. [6] is the ob- 
served ellipticity at 222 nm at any temperature. 

Sedimentation Analysis—Sedimentation equilibrium experiments 
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Fic. 2. Coiled-coil prediction analy- 
sis of the SARS-CoV S glycoprotein. 
A, plot of the predicted stability versus 
sequence position for the SARS-CoV S 
protein using the coiled-coil prediction al- 
gorithm STABLECOIL (53) with a 35-res- 
idue window width. The two regions of 
interest to this study are denoted HR-N 
(for N-terminal heptad repeat) and HR-C 
(for C-terminal heptad repeat). B, sche- 
matic representation of the relative loca- 
tions of the predicted coiled-coils HR-N 
and HR-C within the S protein of the 
SARS-CoV. The location of the predicted 
transmembrane spanning region and fu- B 
sion peptide region are also shown. HR-N 
can be divided into three sub-regions (re- 
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were performed on the analytical ultracentrifuge (XLA from Beckman- 
Coulter) as described by Tripet et al. (47). In general HR-N and HR-C 
peptides were dialyzed exhaustively against 50 mm K,HPO,, 100 mm 
KCl, pH 7 buffer at 4 °C. For full-length HR-N1 peptide, HR-N1 was 
dialyzed against the low salt containing buffer 50 mm K,HPO,, pH 6, for 
enhanced solubility. Three 100-yl aliquots of the sample were loaded 
into a 12-mm 6-sector, charcoal-filled Epon cell and centrifugation 
proceeded for 48 h with rotor speeds of 20,000, 26,000, and 30,000 rpm. 
The HR-N and HR-C peptides were each loaded at initial total peptide 
concentrations of 50, 100, and 250 um, and the full-length HR-N1 
peptide was loaded at 10, 20, and 50 um. The partial specific volume of 
each peptide was calculated using the program SednTerp version 1.08 
(50). The density of the solvent was calculated to be 1.009 g/ml. The data 
were evaluated using a non-linear least squares curve-fitting algorithm 
contained in the WinNonLin analysis program version 1.06 (51). 

Gel Electrophoresis—HR-N and HR-C region peptides singly or as an 
equimolar mixture (200 ym of each peptide) were dissolved in 100 mm 
KCl, 50 mm PO,, pH 7 buffer and incubated at room temperature. 
Samples were then diluted with 1 volume of 2X Laemmli sample buffer 
and analyzed by SDS-PAGE in a 15% Tris/glycine gel. 

Size-exclusion and Reversed-phase Chromatography—HR-N and 
HR-C region peptides singly or as an equimolar mixture (200 uM of each 
peptide) were dissolved in 100 mm KCl, 50 mm PO,, pH 7, and equili- 
brated at room temperature for 30 min. A 10-yl aliquot of the mixture 
was loaded onto a high performance size-exclusion column, Superdex 
75™ (1 x 30 cm, Amersham Biosciences) equilibrated in a buffer 
consisting of 50 mm PO,, 100 mm KCl, pH 7, at a flow rate of 0.75 ml/min 
and ambient temperature. For peptides that formed stable hetero- 
stranded complexes, the complex peak was collected and analyzed by 
reversed phase chromatography on an analytical C8 column (Zorbax 
300SB-C8, 15 cm X 4.6 mm inner diameter, 6.5 uM particle size, 300-A 
pore size; Agilent Technologies). The peptides were eluted from the 
column by employing a linear AB gradient of 2% B/min, where eluent A 
is 0.05% aqueous trifluoroacetic acid and eluent B is 0.05% trifluoro- 
acetic acid in acetonitrile at a flow rate of 1.0 ml/min at room temper- 
ature. To calculate the peptide ratio in the complex, the peak areas of 
each component were compared with peak areas of known standard 
solutions of each peptide. 

Formation of Parallel and Anti-parallel Disulfide-bridged Hetero 
Two-stranded Molecules—Preferential disulfide bridge formation be- 
tween the HR-N9 and HR-C1 regions was performed similar to that 
described by Semchuck e¢ al. (52). In brief, 10 mg of 2,2'-dithiopyridine 
was dissolved in 100 pl of dimethylformamide with sonication. A 10-pl 
(3.4 wmol) aliquot of this solution was added to a solution of the HR-N9 
peptide (2 mg, 0.8 pmol) dissolved in a 3:1 (v/v) acetic acid/H,O buffer 


and stirred for 6 h. The solution was then extracted three times with 
500 pl of ether, and the aqueous layer was applied to a Sephadex G-25 
desalting column conditioned with 50 mm NH, acetate, pH 5.5 running 
buffer. Fractions (1 ml) eluted from the column were collected, and 
those with 220 nm absorbance were pooled. To one-half of the pooled 
solution was added HR-C1 peptide with either an N-terminal CGG or a 
C-terminal GGC linker (1 mg of dissolved in 1 ml of 50 mm NH,Ac, pH 
5.5 buffer) in 100-1 aliquots over 30 min. The reactions were then 
stirred for 1 h, and the final complex was purified by reversed-phase 
HPLC and freeze-dried. 


RESULTS 


Sequence Analysis—To identify the heptad repeat (HR) re- 
gions within the SARS-CoV S protein, we utilized the coiled- 
coil prediction algorithm STABLECOIL (53). Fig. 2A shows the 
graphical output of the analysis using a 35-residue window 
width. The program predicts two coiled-coil regions: the HR-N 
coiled coil (residues 882-1011) and the HR-C coiled coil (resi- 
dues 1147-1185) with an interhelical domain of ~140 amino 
acid residues between HR-N and HR-C. The location of these 
regions appears to be consistent with other type 1 viral fusion 
proteins, in that the HR-C region is located adjacent to the 
predicted transmembrane region (residues 1194-1226), and 
the HR-N region is located C-terminal to a region of high 
sequence similarity to the fusion peptide (residues 851-882) 
(Fig. 2B). The coiled-coil prediction analysis also indicates that 
the HR-N region can be further sub-divided into three regions 
(regions 1—8) based on the assignment of the heptad a and d 
positions (data not shown). The region where the heptad reg- 
isters change is denoted as a frameshift/hinge region (Fig. 2B). 
The program also predicts a second (lower scoring) heptad 
repeat register within HR-N region 2 that can run continuous 
with the heptad register observed in HR-N region 3 (shown 
below the sequence in Fig. 3, top). The presence of this alter- 
nate heptad register indicates that hydrophobic residues also 
occur in the interfacial e and g positions in this location. In 
contrast, the HR-C coiled-coil region shows only a single con- 
tinuous heptad register (Fig. 3, bottom). 

Circular Dichroism Analysis of the HR-N and HR-C Re- 
gions—To examine the structural characteristics of the HR-N 
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Fic. 3. Top, sequence of the predicted HR-N region from residues 882—1011 of the SARS-CoV S protein. The a and d positions of the strongest 
predicted coiled-coil heptad repeats (abcdefg),, are shown above the sequence. The a and d positions of the alternate weaker scoring heptad repeat 
is shown below the sequence. Middle, the names and sequence regions (denoted in parentheses) of the HR-N peptides used in this study. The 
rectangle denotes the HR-N sequence from residues 882 to 1011. The hatched areas denote regions 1-3 (Fig. 2). The bars below the HR-N domain 
indicate the locations of the peptides within the HR-N region used in this study. Bottom, the sequence of the predicted HR-C domain from residues 
1147-1185. The a and d positions are shown above the sequence. The names, sequences, and locations (bars) of the HR-C peptides used in this study 


are shown below the HR-C domain. 


and HR-C regions of the SARS CoV S protein, peptides (see Fig. 
3) corresponding to various regions between residues 882 and 
1185 were synthesized and analyzed by CD spectroscopy. Fig. 4 
(A and B) shows representative far-UV CD spectra and thermal 
denaturation profiles of the HR-N1, HR-N3, HR-N4, HR-N5, 
and HR-C1 peptides. Analysis of the HR-N1 peptide, which 
represents almost the entire length of the predicted HR-N 
region, showed limited solubility in 0.1 m KCl, 0.05 m K,HPO,, 
pH 7 buffer; thus the CD study could only be carried out at 15 
pM. The CD profile of HR-N1 showed the characteristic a-hel- 
ical spectrum with double minima at 208 and 222 nm. The 
molar ellipticity at 222 nm (—15,100°) corresponds to 40 helical 
residues out of the maximum 62 helical residues induced in 
50% TFE for the molecule under the conditions analyzed. Tem- 
perature denaturation of HR-N1 showed the a-helical structure 
had high stability with 50% of its structure unfolded at 77 °C 
(Table I). 

Mapping of the HR-N region further with smaller peptides 
showed that HR-N5 and HR-N4 (regions 1 and 3) were also 
largely insoluble, and the limited amount that did dissolve (15 
pM) exhibited random coil spectra (Fig. 4A). The peptides, 
however, could be induced into an a-helical conformation in the 
presence of 50% TFE, a helix-inducing solvent, indicating they 
do have an intrinsic ability to be helical (Table I). In contrast, 
the HR-N3 peptide corresponding to the central region of HR-N 
(region 2, Fig. 2) showed good solubility characteristics (>5 
mg/ml) and exhibited a strong a-helical spectrum (Fig. 4A). The 
mean residue molar ellipticity value (—24,600°) indicated that 
HR-N3 was fully helical relative to the maximum inducible 
a-helix in TFE but only 68% helical relative to the theoretical 
value for a fully folded helical molecule of its 47-residue length 
(—36,000°). The thermal melting profile of this peptide showed 


that it had high stability similar to that of the longer length 
HR-N1 peptide (compare Ty, values of 63 versus 77°C for 
HR-N8 and HR-N1, respectively). An extension of HR-N8 pro- 
duced a peptide HR-N2 of 58 residues. This peptide had a 
similar number of helical residues (Table I) and a Ty, of 68 °C. 

We analyzed the CD spectra of several overlapping 35-resi- 
due peptides shifted 1 heptad (7 residues) toward the C termi- 
nus of the HR-N region beginning at HR-N5 and ending with 
HR-N17 (Fig. 3). All of these peptides showed mainly random 
coil spectra with the exception of HR-N12, HR-N13, HR-N14, 
and HR-N15 that displayed helical spectra profiles. Most in- 
teresting, these peptides corresponded to the same 47 residues 
of the sequence of HR-N3 that was also observed to be helical 
(Fig. 4 and Table I). It is also notable that the region in HR-N 
that was predicted to have the highest likelihood of forming a 
coiled-coil structure (Fig. 2A) corresponds with the HR-N3 se- 
quence region identified to form a stable a-helical structure. 
Most interesting, the stability of peptides (as measured by their 
Ty values) for HR-N12 (Ti, 52 °C), HR-N3 (Ty, 63 °C), HR-N2 
(Ty 68 °C), and HR-N1 (Ti, 77 °C) increased with increasing 
chain length from 35, 47, 58, and 92 residues, respectively 
(Table I). The number of helical residues in benign medium for 
these peptides also increased with chain length from 17 to 40 
residues. 

The far-UV CD spectrum of HR-C1 exhibited double minima 
at 208 and 222 nm (Fig. 4A) indicating a-helical structure. The 
mean residue molar ellipticity was —12,600° at 100 um and 
increased to —22,000° at high concentrations (>500 uM) similar 
to the value observed in the presence of TFE. Temperature 
denaturation of HR-C1 showed the peptide has low stability as 
50% of the molecule could be unfolded at 33 °C (Fig. 4B and 
Table I). 
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HR-N1, HR-N4, and HR-N5. B, tempera- 200 
ture denaturation profiles of the HR-N 

and HR-C peptides monitored by CD at 

222 nm in a 0.1 M KCl, 0.05 m PO,, pH 7 

buffer. Concentrations were 15 um for 

HR-N1 and 100 uM for HR-N3 and HR-C1 

peptides. The fraction folded (f,) of each B 
peptide was calculated as f,; = ([@] — [6],,)/ 

({6],, — [6],,), where [6] is the observed 

mean residue ellipticity at 222 nm at any 
particular temperature, and [6], and [6], 

are the mean residue ellipticities at 222 

nm of the native folded state at 4 °C and 

unfolded states, respectively. 


Fraction Folded 


10 


To see if we could define further the a-helical region within 
HR-C (39 residues), we analyzed three truncated peptides of 
HR-C1. CD analysis of HR-C2 and HR-C8 (21 and 28 residue 
length, respectively) displayed random coil spectra in benign 
buffer, whereas HR-C4 (35-residue length) showed a-helical 
structure with a [6]... value of —17,500° (Table I). Addition- 
ally, the thermal melting profile of HR-C4 indicated the same 
temperature transition midpoint (33 °C) as full-length HR-C1 
(Table I). Thus the minimal length required for folding of the 
HR-C region coiled coil appears to be 35 residues corresponding 
to the sequence 1151-1185. 
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Oligomeric State of the HR Regions—Sedimentation equilib- 
rium experiments were carried out to determine the oligomeric 
states of the HR-N and HR-C peptides. The HR-N1 peptide was 
studied at three different concentrations and three different 
rotor speeds. The data obtained fit well to a single species 
model with a weight average molecular weight of 48,750 (Fig. 5, 
left). Surprisingly, this value corresponds well to that expected 
for a tetramer of HR-N1, based on the amino acid sequence (4 X 
12,004 = 48,016 Da). The high quality of the single species fit 
at the different concentrations analyzed suggests that the 
HR-N1 peptide was exclusively tetrameric over the entire con- 
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SARS-Coronavirus Spike S Fusion Protein 20841 
TABLE I 
Ellipticity and stability of the synthetic peptides studied 
[@loo0° Helical residues® 
Peptide name* No. residues % Helix (benign/TFE)@ Ty! 
Benign 50% TFE Benign 50% TFE 
degrees:cem?-dmol 1 °G 
HR-N1? 92 —15,100 —25,800 40 62 65 77 
HR-N2 58 —21,100 —23,300 33 37 89 68 
HR-N3 47 —24,600 —21,500 32 28 100 63 
HR-N4” 38 —6,100 —31,300 34 
HR-N5" 35 —4,300 —24,200 24 
HR-N6 35 —3,300 —20,100 20 
HR-N7 35 —2,800 —24,200 24 
HR-N8 35 —2,500 —25,100 25 
HR-N9 35 —2,800 —26,800 27 
HR-N10 35 —3,600 —28,500 29 
HR-N11 35 —3,200 —25,200 25 
HR-N12 35 —16,500 —21,000 17 21 81 52 
HR-N13 35 —8,700° —20,500 21 
HR-N14 35 —7,600° —12,000 12 
HR-N15 35 —6,200° —14,900 15 
HR-N16 35 —4,800 —19,000 19 
HR-N17 35 —5,500 —23,300 24 
HR-C1 39 —13,600 —23,800 15 26 58 33 
HR-C2 21 —3,600 —15,400 10 
HR-C3 28 —5,800 —19,200 16 
HR-C4 35 —17,500 —19,800 18 20 90 33 


“ Name of each peptide studied. The sequence position of each peptide is shown in Fig. 3. 

® The mean residue molar ellipticities at 222 nm were measured at 22°C in benign buffer (0.1 M KCl, 0.05 M K,PO,, pH 7). For samples containing 
TFE, the above buffer was diluted 1:1 (v/v) with TFE. 

©The number of a-helical residues was calculated based on the predicted molar ellipticity. The predicted molar ellipticity for a completely 
a-helical peptide was calculated using []),.oretical = *40,000 (1-4.6/N), where N is the number of residues in the polypeptide chain. The number 
of a-helical residues in benign or TFE conditions was calculated as the observed [6] divided by the theoretical [6]. 

¢ The percent a-helix in benign medium at the concentration used was calculated as the number of helical residues in benign divided by the 
maximum inducible a-helical residues in TFE. Note that in oligomerizing a-helices the helical content is concentration-dependent. Higher 
concentrations shift the monomer to oligomer equilibrium increasing the a-helical content based on a two-state unfolding mechanism (random coil 
to coiled coil). 

© Ti is the transition midpoint temperature at which there is a 50% decrease in molar ellipticity [6]... compared with the fully folded peptide 


as determined by CD at 5 °C. All peptides in which Ti, was determined are in boldface type. 
’ These peptides all showed a characteristic a-helical spectrum with minima at 208 and 222 nm even though the [6]. was low. 
* Results obtained at a peptide concentration of 15 ym instead of 100 uM used for the other determinations. 
” Peptides were studied in 0.05 m K,PO, buffer, pH 7, due to low solubility. 


centration range of the experiment. This result was surprising 
considering earlier analyses of fusion proteins and ectodomain 
cores of other viruses have been observed to be trimeric (54— 
59). Our results, however, were consistent with the sedimen- 
tation equilibrium analysis of the smaller truncated HR-N 
peptide, HR-N3, which also displayed a-helical structure. Sed- 
imentation equilibrium data obtained for this peptide best fit to 
a single species model with weight average molecular weight of 
19,000, close to that expected for a tetramer (theoretical mo- 
lecular weight 20,212, 4 x 5,053, data not shown). 

In contrast, sedimentation equilibrium experiments of the 
HR-C1 peptide at three different concentrations and three 
speeds showed that the data were best fit globally to an asso- 
ciating monomer to trimer equilibrium model. Fig. 5 (right) 
shows a plot of the absorbance versus radial distance squared 
divided by 2 and the residuals of the fit for the HR-C1 peptide. 
At the highest concentration (250 mm) and highest speed 
(30,000 rpm), the HR-C1 peptide showed a weight average 
molecular weight of 13,900, slightly greater than the theoreti- 
cal mass for a trimer (13,007 Da). The deviations from the 
expected trimeric species near the bottom of the cell are likely 
due to aggregation at this high concentration (~1 mm). 

Interaction between the HR-N and HR-C—Previously, it has 
been shown that HR-N and HR-C of the murine coronavirus 
MHV §S glycoprotein self-associate to form a 6-helix bundle 
structure typical of other type 1 viral fusion proteins (19). To 
identify the corresponding sequences within the SARS-CoV S 
glycoprotein, mixtures of the HR-N and HR-C peptides were 
screened by CD spectroscopy and SEC. Initially, mixtures be- 
tween the three HR-N peptides, HR-N5, HR-N3, and HR-N4 
(which correspond to regions 1—3 of HR-N, Fig. 2), with HR-C1 


were screened. These peptide mixtures demonstrated no de- 
tectable interaction, i.e. the CD spectra were super-imposable 
with the theoretical spectra calculated for two non-interacting 
peptides. Additionally, the SEC profiles were similar to those of 
the individual peptides alone. We next analyzed the HR-N 
peptides corresponding to the intervening frameshift/hinge re- 
gions (Fig. 2). No interactions were observed for peptides HR- 
N6, HR-N7, and HR-N8. In contrast, HR-N9 showed the induc- 
tion of 3,500° at 222 nm; HR-N10 showed the induction of 
9,600°, and shifting 7 residues (1 heptad) further toward the C 
terminus (HR-N11) resulted in the absence of any observable 
interaction once again with HR-C1 (Table II and Fig. 6). Sim- 
ilar mapping results were also observed in the temperature 
denaturation profiles and SEC analysis of the mixtures. The 
mixture between HR-N9/HR-C1 showed a Ty, of 36 °C, a 3 °C 
rise relative to HR-C1 alone (Ty, 33 °C), and the SEC showed 
the formation of a higher molecular weight complex of ~30% of 
the observed area units. HR-N10/HR-C1 (Ty, 57 °C) showed a 
24°C increase in Ty from HR-C1 (Fig. 6C), and the SEC 
showed largely one single peak of higher molecular weight (Fig. 
7). The mixture between HR-N11/HR-C1 showed no complex 
peak in the SEC. It is worthy of mention that the CD signal at 
222 nm for the strongest interacting peptides (HR-N10/HR-C1) 
only showed a molar ellipticity of —17,700, ~51% of a theoret- 
ical fully helical peptide of 35 residues in length despite the 
significant increase in stability observed (data not shown). 

To confirm that the structure of the HR-N region is not a 
factor in binding to HR-C1, as HR-N10 was a random coil on its 
own (Fig. 6A), we extended the length of HR-N3 (residues 
927-973) that exhibited an a-helical structure by 11 additional 
N-terminal residues (HR-N2) to contain the complete binding 
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Fic. 5. Equilibrium ultracentrifugation analysis of the HR-N and HR-C peptides. Left side, middle, a plot absorbance versus radial 
distance squared divided by 2 for HR-N1 at 22 °C and 26,000 rpm in a 0.1 mM KCl, 0.05 m PO,, pH 6 buffer. Protein concentration was 15 pM. Left 
side, bottom, a plot of the natural logarithm of the absorbance versus radial distance squared divided by 2 for the same data. The theoretical lines 
for a single species monomer (M), dimer (D), trimer (Tri), and tetramer (Tet) are shown for comparison. The data best fit a single species model 
with tetrameric molecular weight. Right side, middle, a plot of absorbance versus distance squared divided by 2 of the HR-C1 peptide at 22 °C and 
30,000 rpm in a 0.1 M KCl, 0.05 m PO,, pH 7 buffer. Peptide concentration was 250 uM. Right side, bottom, a plot of the natural logarithm of the 
absorbance versus radial distance squared divided by 2 for the same data. The theoretical lines for monomer, dimer, and trimer are shown. The 
data were fit best to a monomer to trimer associating model in equilibrium. Residuals from both fits are shown above. 


site of HR-N10. This peptide showed strong complex interac- 
tions with HR-C1 as assessed by SEC. Furthermore, the tem- 
perature denaturation profile indicated the midpoint rose 6 °C 
from 68 °C for HR-N2 alone to 74 °C for the complex HR-N2/ 
HR-C1 (Table I and Table II). The denaturation profile was 
cooperative for this complex versus a biphasic transition profile 
observed for HR-N3 with HR-C1 which showed T'y values cor- 
responding to the individual domain Ty, values indicative of 
non-interacting helices. Thus HR-C1 forms a complex with the 


HR-N region whether the N-region is unfolded (HR-N10) or 
folded (HR-N2). 

We also analyzed the truncated peptides of HR-C1 in an 
attempt to localize the binding site for HR-N10. HR-C2 and 
HR-C3 peptides of 21 and 28 residue length showed no ability 
to interact the HR-N10 region as assessed by CD spectroscopy 
and SEC (Table II). In contrast, the HR-C4 peptide of 35- 
residues which folded independently into an a-helical structure 
showed strong complex formation with the HR-N10 peptide 
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SARS-Coronavirus Spike S Fusion Protein 20843 
TABLE II 
CD and SEC analysis of the HR-N/HR-C peptide mixtures 
b 
Peptide mixture (loos Change in molar ellipticity® Ty SEC* 
Observed Predicted 
degrees:em?-dmol—1 °¢ 

HR-N2/HR-C1 —17,900 —17,000 900 74 ++ 
HR-N3/HR-C1 —18,000 —18,000 0 - 
HR-N4/HR-C1/ —9,400 —9,400 0 a 
HR-N5/HR-C1l’ —8,500 —8,500 0 - 
HR-N6/HR-C1 —8,600 —8,600 0 - 
HR-N7/HR-C1 —7,400 —7,400 0 - 
HR-N8/HR-C1 —7,900 —7,900 0 - 
HR-N9/HR-C1 —10,800 —7,300 3,500 36 + 
HR-N10/HR-C1 —17,700 —8,100 9,600 57 ++ 
HR-N11/HR-C1 —8,000 —8,000 0 - 
HR-N12/HR-C1 —14,900 —14,900 0 - 
HR-N13/HR-C1 —10,300 —10,300 0 - 
HR-N14/HR-C1 — 12,500 —12,500 0 - 
HR-N15/HR-C1 —9,500 —9,500 0 - 
HR-N16/HR-C1 —9,000 —9,000 0 - 
HR-N17/HR-C1 —9,500 —9,500 0 - 
HR-N10/HR-C2 —3,800 —3,800 0 - 
HR-N10/HR-C3 —4,900 —4,900 0 - 
HR-N10/HR-C4 —7,900 —10,500 7,400 56 ++ 
HR-N10/HR-C 1(antiparallel)ox —9,200 76 
HR-N10/HR-C1(antiparallel)ox —3,300* 748 
HR-N10/HR-C1(parallel)ox —12,200" 57% 


“Name of the peptides in the mixture. For sequence regions see Fig. 3. Names followed by “ox” indicate peptide mixtures that have been 


covalently linked by a disulfide bridge. 


® The mean residue molar ellipticities at 222 nm were measured at 22 °C in benign buffer (0.1 M KCI, 0.05 m K,PO,, pH 7). 

© The difference in molar ellipticity from the observed and predicted values (column 2 and column 3). 

¢ Ty, is the transition midpoint temperature at which there is a 50% decrease in molar ellipticity [@],.. compared with the fully folded peptide 
as determined by CD at 5 °C. When no change in molar ellipticity (column 3) was observed, the ti, was not determined. 

° Summary of the analysis of the peptide complexes by SEC. — denotes no complex formation; + denotes weak complex formation between the 
two peptides (~30% of total peak area or less); ++ denotes strong complex formation between the two peptides (>95% of total peak area). 

’ Results were obtained for peptide complexes scanned in a 0.05 m K,PO, buffer. 

® Peptide complex concentration was 100 and 15 un. The predicted molar ellipticity for two non-interacting peptides was calculated by summing 


the two individual spectra. 


observed by CD and SEC (Table II). For example, the temper- 
ature denaturation profile of HR-N10/HR-C4 displayed a sim- 
ilar temperature midpoint and molar ellipticity observed for 
HR-N10/HR-C1 (compare 57 °C with 56 °C and —17,700 with 
—17,900, respectively). Thus the binding interaction between 
the HR-N and HR-C regions appears to be localized to residues 
916-950 in the center of HR-N and 1151-1185 within the HR-C 
region (based on the N-terminal truncations). 

HAR-N and HR-C Form a 3:3 Hexameric Complex—To deter- 
mine the ratio of the peptides within the HR-N10/HR-C1 com- 
plex, mixtures of HR-N10 and HR-C1 peptides at molar ratios 
of 1.5:1, 1:1, and 1:1.5, respectively, were pre-incubated for 30 
min and then applied to a size-exclusion column (Fig. 7). The 
earlier eluting complex was collected and applied to a reversed- 
phase C8 analytical HPLC column and eluted (Fig. 7, inset). 
Both HR-N10 and HR-C1 peptides were observed. Integration 
of the peak areas for each component and conversion of these 
values to mole amounts showed that the two peptides within 
the complex were present at a 1:1 molar ratio. 

Next, we analyzed the molecular mass of the complex. Bosch 
et al. (19) had shown for MHV-A59 that complexes of HR-1 and 
HR-2 are stable during SDS-PAGE as long as the samples are 
not heated before loading. Therefore, HR-C1, HR-N10, or 
HR-N2 peptides separately or as pre-incubated equimolar mix- 
tures were subject to Tris/SDS-15% PAGE (Fig. 8). The HR- 
N10 peptide, which CD spectroscopy had shown to be a random 
coil, migrated in the gel to a location corresponding to its 
molecular weight (~3800 Da). In contrast, the HR-C1 peptide 
and HR-N10/HR-C1 complex migrated with higher molecular 
masses. The mobility of the HR-C1 peptide ran according to a 
molecular mass of ~13 kDa (just below the 16.8-kDa marker) 
indicating a trimeric state (theoretical mass of 13,008 Da), and 
the HR-N10/HR-C1 complex migrated according to a molecular 


mass of ~24 kDa corresponding to a hexameric (6-stranded) 
complex (theoretical mass of 24,279 Da) (Fig. 8A). 

Similar results were also seen for the HR-N2/HR-C1 pep- 
tides (Fig. 8B). The HR-N2 peptide migrated as a broad band 
between ~24 and ~17 kDa with the most pronounced band at 
~17 kDa, indicating a molecular mass between tetramer and 
trimer (theoretical masses 24,512 and 18,384 Da, respectively), 
and the HR-C1 peptide migrated according to a mass of ~13 
kDa (close to a trimeric mass), and the HR-N2/HR-C1 complex 
migrated according to a molecular mass of 29 kDa, close to the 
mass of a 3:3 hexamer (theoretical mass 31,692 Da). 

Sedimentation equilibrium analysis was carried out on the 
more stable HR-N2/HR-C1 complex. As shown in Fig. 9, the 
data were fit best to a single species model with molecular mass 
of 31,500 Da, close to the theoretical mass of a 3:3 mole ratio 
(6-stranded) complex between the HR-N2 and HR-C1 peptides 
(theoretical mass 31,692 Da). 

Antiparallel Helix Orientation—To investigate the orienta- 
tion of the helices within the HR-N10/HR-C1 complex, we chose 
to form disulfide-bridged heterodimers between HR-N10 and 
HR-C1 in the parallel and antiparallel polypeptide chain ori- 
entation. The premise of this experiment was that the correct 
polypeptide chain orientation should form a tightly folded hel- 
ical structure of high solubility and greater stability due to 
elimination of the concentration dependence between the two 
peptides, whereas the incorrect orientation would form head- 
to-tail complexes, which would aggregate and show little or no 
increase in stability (Fig. 10A). To do this, we added Cys-Gly- 
Gly residues to the N terminus of HR-N10 and either the N or 
C terminus of HR-C1. The peptides were then selectively co- 
valently linked using the thiol coupling agent 2,2’-dithiopyri- 
dine (see “Materials and Methods”). 

Table II and Fig. 10 show helicity and stability of the two 
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Fic. 6. Interaction between HR-N 
and HR-C peptides. A, circular dichro- 
ism spectra of HR-N10 peptide alone. 
Spectra were recorded at 25 °C ina 0.1M 
KCl, 0.05 m PO,, pH 7 buffer. For the 
spectrum containing 50% TFE, the above 
buffer was diluted 1:1 (v/v) with TFE. B, 
CD spectrum of a 1:1 molar complex be- 
tween HR-N10 and HR-C1 peptides at 
25 °C in a 0.1 m KCl, 0.05 m PO,, pH 7 
buffer. Peptide concentrations were 80 
pM. The theoretical spectrum for two non- 
interacting peptides is shown for compar- 
ison. This spectrum is generated by add- 
ing the individual peptide spectra at the 
same concentrations. C, temperature de- 
naturation profiles of HR-C1 alone and a 
1:1 molar HR-C1 with HR-N10 complex 
monitored by CD at 222 nm ina 0.1™M 
KCl, 0.05 m K,PO,, pH 7 buffer. Concen- 
trations were 100 uM. Fraction folded was 
calculated as described in Fig. 4 legend. 
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Fic. 7. HPLC analysis of the HR- £& 5 «0 
N10/HR-C1 complex. HR-N10 (2 nmol) “=” 8 
and HR-C1 (2 nmol) were pre-incubated £ 2 200 
together for 30 min in 10 yl of running c 500 5 
buffer and then applied to a Superdex™ °o 0 
75 SEC column equilibrated in a buffer = ” ear 
consisting of 0.1 mM KCl, 0.05 mM PO,,pH7, WG 400 4 Retention Time (min) 
and a flow rate of 0.7 ml/min. The absorb- fh) 
ance peak at 13 min corresponding to the oO 
HR-N10/HR-C1 complex was collected c 
and subsequently analyzed by reversed § 300 - 
phase chromatography (inset) on an an- — 
alytical C8 Zorbax column employing a 2 
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Fic. 8. Molecular mass of the complex formed between the HR-N and HR-C regions as determined by gel electrophoresis. A, 
HR-N10 and HR-C1 peptides on their own or as pre-incubated equimolar (200 uM of each peptide) mixture were subjected to Tris/SDS-15% PAGE. 
Samples were incubated for 30 min in 0.1 m KCl, 0.05 m PO,, pH 7 buffer, and the diluted 1:1 (v/v) with 2x Laemmli sample buffer at room 
temperature and loaded into the gel. The positions of HR-N10, HR-C1, and HR-N10/HR-C1 complex are indicated on the left side of the gel, whereas 
the positions of the molecule mass markers are indicated on the right side of the gels with arrows. Note the absence of the HR-N10 band was a 
result of the peptide eluting from the gel during the staining/destaining steps. B, HR-N2 and HR-C1 peptides treated in a manner similar to that 
described in A. The lanes of the gel, peptide locations, and molecular mass markers are labeled accordingly. 


disulfide-bridged complexes. The first complex in which the 
HR-N10 and HR-C1 peptides were linked in an anti-parallel 
orientation showed high solubility (>5 mg/ml or 1 mm) and a 
temperature transition midpoint of 76 °C, an increase of 19 °C 
relative to the non-disulfide-bridged complex analyzed at the 
same concentration (HR-N10/HR-C1 complex, Ti, of 57 °C, Ta- 
ble II). In contrast, the complex between HR-N10 and HR-C1 
disulfide-bridged in the parallel orientation was largely insol- 
uble in the same buffer conditions (solubility ~15 ym). The low 
solubility was completely dependent upon the disulfide bridge 
link between the peptides, as addition of dithiothreitol to re- 
duce the disulfide bridge completely reversed the solubility 
characteristics and allowed the complex to be fully soluble at 
high concentrations once again. A temperature denaturation 
melting profile was obtained at 15 ym for the parallel complex 


and was compared with the anti-parallel complex at the same 
concentration (Fig. 10B). The temperature transition midpoint 
of the parallel complex was 57 °C, similar to that observed for 
the non-disulfide-bridged complex, indicating the interactions 
between HR-N10 and HR-C1 are likely inter-molecular, 
whereas the anti-parallel complex showed a temperature tran- 
sition midpoint of 74 °C, indicating that it was 17 °C greater 
than the opposite orientation. It is noteworthy that a CD scan 
of the anti-parallel complex at 100 pum concentration only 
showed a molar ellipticity reading of —19,200° at 222 nm (a 
fully folded a-helix of 35 residues is predicted to have a molar 
ellipticity of —34,700°). Thus the highly stable complex either 
has unfolded residues at the termini of the helices or, when in 
a complex, one or the other helix exists partly in a non-helical 
form. Taken together, however, the results suggest the HR-N 
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Fic. 9. Molecular mass of the com- 
plex formed between HR-N2 and 
HR-C1 peptides as determined by 
sedimentation equilibrium analysis 
at 22°C and 30,000 rpm. The middle 
panel (absorbance versus radial distance 
squared divided by 2) shows the best fit 
curve for a single species model indicating 
a molecular mass of 30,500 Da. The theo- 
retical mass for a 3:3 mole ratio HR-N2/ 
HR-C1 hexamer is 31,692 Da. The upper 
panel shows the residuals from the curve 
fit, and the lower panel shows the In ab- 
sorbance versus r?/2 of the data compared 
with theoretical 1:1 dimer (D), 2:2 tet- 
ramer (Tet), and 3:3 hexamer (Hex). 
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and HR-C regions associate in an anti-parallel manner in the 
hexameric complex state. 


DISCUSSION 


Here we used CD spectroscopy, SEC, and sedimentation 
equilibrium analysis to characterize the predicted HR-N and 
HR-C regions in the ectodomain of the spike S glycoprotein of 
the SARS-CoV. In isolation, the HR-N region folds as an inde- 
pendent a-helical coiled-coil structure of high stability, but 
with only ~40 helical residues of a possible 92 residues. Anal- 
ysis of smaller peptides of HR-N showed that the helical con- 
tent is largely localized to residues 927-973 in the central 
portion of HR-N and the N and C termini were unfolded. It is 
interesting that the N-terminal region closest to the predicted 
fusion peptide sequence (residues 851-882, Fig. 2, a region of 
high sequence similarity to the fusion peptides of other type 1 
fusion proteins, which is thought to insert into the target mem- 
brane during the fusion process) did not show any a-helical 
coiled-coil structure, indicating this region does not spontane- 
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ously fold into a coiled coil and therefore requires other condi- 
tions or parts of the molecule. This is similar to observations in 
influenza HA, Ebola virus GP2, human T-cell leukemia virus 
type 1, gp21, and RSV F fusion proteins. In the case of influ- 
enza HA, the N terminus of the HA, coiled coil at neutral pH 
exists in part as an extended conformation but undergoes a 
conformational change to a more stable elongated coiled coil 
only when HA encounters a low pH acid environment in a 
series of steps termed the spring-loaded mechanism (60-68). It 
should be noted that we did analyze the HR-N1 peptide at 
lower pH (4.5) and did not observe any significant increase in 
helicity (data not shown). Moreover, CoV S proteins are gener- 
ally stable at acid pH, and MHV S can be triggered to its 
fusogenic state at pH 8, 37 °C (28). 

It is noteworthy that studies on the HR regions of the S 
protein of the mouse hepatitis virus strain A59 showed the 
a-helical content of the separate HR-N and HR-C peptides were 
calculated to be ~89%, and that the helical content of their 
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Fic. 10. Helix orientation in the 
HR-N/HR-C complex. A, schematic rep- 
resentation of the use of selective disul- 
fide bridge formation to form parallel and 
anti-parallel hetero-stranded polypep- 
tides, and the effect the relative helix ori- 
entations could have on the final folded 
state of the complex. The HR-N helix is 
represented as a white rectangle and cor- 
responds to residues 916-950. The HR-C 
helix is represented as a black rectangle 
and corresponds to residues 1147-1185. 
The disulfide bridge between two cysteine 
residue side chains is denoted by S-S. B, 
temperature denaturation profiles of par- 
allel and anti-parallel disulfide-bridged 
HR-N10/HR-C1 complexes in 0.1 mM KCl, 
0.05 M PO,, pH 7 buffer monitored by CD 
at 222 nm. Peptide concentrations were 
15 pM. Fraction folded was calculated as 
described in Fig. 4 legend. The tempera- 
ture melting profile of the non-disulfide- 
bridged HR-N10/HR-C1 peptide is shown 
for comparison. 
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equimolar mixture was calculated to be ~82% (19). Further- 
more, electron microscopy showed rod-like structures of ~14.5 
nm correlating well with the length predicted for an a-helix the 
size of HR-N (96 residues). Because the sequences between 
coiled-coil domains of the SARS-CoV Urbani strain analyzed 
here and MHV-A59 have a high degree of sequence identity, 
this difference was quite surprising. One possibility for the 
difference may be due to the buffer conditions in which the 
samples were analyzed. The MHV-A59 peptide was analyzed in 
H,0 versus physiological conditions used in this study (100 mm 
KCl, 50 mm K,HPO,, pH 7 buffer). Alternatively, the difference 
could be due to the low concentration at which we analyzed 
SARS HR-N1 (15 uM). The a-helical contents of peptides that 
form coiled-coil structures are affected by concentration due to 
the monomer-oligomer equilibrium (two-state folding mecha- 
nism random coil to coiled coil). It should be noted, however, 
that we observed a strong a-helical coiled coil in HR-N between 
residues 927 and 973, and this region correlates well with the 
protease-resistant portion of the HR-N region determined by 
Bosch et al. (19), and MHV S residues 976-1040 correspond to 
residues 909-973 of the SARS-CoV S protein sequence. 
Sedimentation equilibrium analysis showed that both the 
long and short folded HR-N region peptides associated as a 
tetramer; however, in SDS-PAGE the shorter HR-N region 
(HR-N2) displayed a smear between tetramer and trimer with 
the more dominant band displaying a trimeric molecular 
weight. This difference may indicate that the HR-N region 
naturally associates as a trimer, but in the absence of the HR-C 


20847 


ar 


folding 
o° (ss 


Head-to-Tail Aggregation 


Antiparallel 


Tightly Folded 


HR-N10/HR-C1 
Antiparallel,, 


Q 
on Ahi eK 
LEESON 


30 40 50 60 70 80 90 


Temperature (°C) 


region (or other parts of the native sequence) the hydrophobic 
residues occurring in the e and g positions are exposed. To 
sequester these residues from the aqueous environments, the 
protein switches from a trimer to a tetramer state. In agree- 
ment with this proposal is the observation that more interfacial 
surface areas, in particular the e and g positions, are buried in 
a tetrameric coiled-coil structure relative to a trimeric or di- 
meric coiled coil (64). 

The HR-C region in isolation showed a very concentration- 
dependent a-helical coiled-coil structure that at a very high 
concentration (>500 pM) indicated only ~24 helical residues. 
The truncation studies showed that the N-terminal four resi- 
dues could be deleted without adversely affecting the structure. 
Additionally both sedimentation equilibrium and gel electro- 
phoresis showed that the entire HR-C region alone associates 
as a homotrimer. Consistent with the helicity data, stability 
analysis showed that HR-C1 was of low stability with 50% of its 
structure unfolded at 33 °C. This and the above characteristic 
are surprising, considering that 9 out of the 10 heptad a and d 
core positions contain large optimal hydrophobic residues (e.g. 
Tle, Ile, Val, Ile, Ile, Leu, Leu, Leu, and Leu), and no apparent 
intermolecular electrostatic repulsions can be predicted. The 
low stability, however, does fit with the general mechanism of 
type 1 viral fusion proteins, where the C-terminal HR must 
separate in order to form a complex with the N-terminal HR. In 
agreement with this is the observation that there are no pos- 
sible inter-chain electrostatic attractions that can occur be- 
tween peptide chains. Such electrostatic interactions have been 
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shown to add considerable stability to coiled coils and stabilize 
their ability to stay associated (65). 

Upon mixing the HR-Cl peptide with several truncated 
HR-N region peptides, HR-C1 associated preferentially with 
HR-N10 to form a stable a-helical 6-stranded complex, consist- 
ing of a 3:3 mole ratio of HR-N10:HR-C1. Additionally, the 
selective disulfide bridge experiments indicated that the rela- 
tive orientation of HR-N10 and HR-C1 peptides in the complex 
were anti-parallel relative to one another. This orientation is 
consistent with that observed for other type 1 fusion-active core 
complexes (25-36). Furthermore, the peptide mapping study 
also revealed that a 35-residue length is optimal for the binding 
interaction with HR-Cl, as shifting either 7 residues toward 
the N or C terminus of HR-N (HR-N9 or HR-N11) caused 
significant loss in the ability to interact with HR-C1. In gen- 
eral, mapping of the HR-N10 region as the primary site of 
interaction with HR-C1 appears to be entirely consistent with 
the location recently identified in the MHV-A59 coronavirus 
strain. Bosch et al. (19) identified MHV S residues ~969-1014 
as the site of interaction with HR-C. The SARS-CoV HR-10 
peptide corresponds to residues 984-1013 of MHV by sequence 
alignment. Moreover, the strong conservation in location of the 
HR-C-binding site between the two coronaviruses, despite 
slight sequence differences in HR-C, likely reveals the impor- 
tance of this site for the coronavirus fusion mechanism. 

Most interesting, despite CD analysis of the HR-N10/HR-C1 
complex showing a significant increase in thermal stability and 
a-helical induction, the complex did not adopt a fully helical 
structure (—17,700° molar ellipticity observed). Some loss can 
always be attributed to end fraying and concentration depend- 
ence (66—68). However, when HR-N10 and HR-C1 were disul- 
fide-bridged, removing the concentration dependence between 
these two peptides, the molecule only displayed —19,200° mo- 
lar ellipticity. One possible explanation may be that the inter- 
action between HR-C and HR-N10 is not fully helical but more 
akin to the binding interactions observed for HR-C to HR-N of 
the F protein of the paramyoxovirus SV5 (25). The structure of 
the fusion-active core region of the SV5 F protein shows that 
the HR-C (HR2) domain contains both an a-helix and an ex- 
tended conformation that stretches out along the HR-N (HR2) 
region. Whether this is also the case for S of SARS-CoV or not 
will be best resolved with subsequent high resolution struc- 
tural studies. 

It is generally accepted that viral fusion proteins can adopt 
at least two different conformations (i.e. native and fusogenic 
states) and that the change in conformation is required for 
fusion of the viral and cellular membranes during virus entry 
(24). For the HA protein of influenza virus, HA is most stable in 
its fusogenic state, whereas HA in its native state is meta- 
stable, and formation of the more stable helical hairpin com- 
plex provides the energy to overcome the activation energy 
needed to bring the two lipid bilayers close together at the 
fusion site (60, 62, 63). The crystal structure of a large number 
of other viral fusion proteins in the more stable fusogenic state 
(25-36) confirms many aspects of this proposed model. Because 
the a-helical heterotrimeric (6-helix) complex consisting of HR- 
N10 and HR-Cl1 peptides of SARS-CoV S protein displays 
greater stability than the separate domains, we speculate that 
this 6-helix bundle structure represents the core of the fusion- 
competent state of the SARS-CoV S protein. 

Finally, the observation of the 6-helix bundle conformation 
suggests that peptides corresponding to the SARS-CoV S pro- 
tein HR regions should efficiently inhibit viral fusion as ob- 
served with other type 1 viral fusion proteins (19, 37-46). In 
these cases, free HR-N or HR-C peptides compete for binding to 
the HR-N and HR-C coiled coils in the native state thereby 
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blocking the conformational transition to the fusion-active 
form. Analysis of the inhibitory effects of the SARS-CoV S 
peptides is now in progress. 

Taken together, these studies of the SARS-CoV HR regions 
support the emerging structural theme for type 1 viral fusion 
proteins. In the native state, the two HR regions of the fusion 
protein ectodomain oligomerize to form homotrimeric coiled- 
coil cores, which when stimulated by receptor interaction un- 
dergo a conformational change to an antiparallel 6-stranded 
a-helical structure (trimer of dimers) in the fusion active state 
to initiate the fusion of viral and cellular membranes. 
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